Efficient Seeds Computation Revisited

نویسندگان

  • Michalis Christou
  • Maxime Crochemore
  • Costas S. Iliopoulos
  • Marcin Kubica
  • Solon P. Pissis
  • Jakub Radoszewski
  • Wojciech Rytter
  • Bartosz Szreder
  • Tomasz Walen
چکیده

The notion of the cover is a generalization of a period of a string, and there are linear time algorithms for finding the shortest cover. The seed is a more complicated generalization of periodicity, it is a cover of a superstring of a given string, and the shortest seed problem is of much higher algorithmic difficulty. The problem is not well understood, no linear time algorithm is known. In the paper we give linear time algorithms for some of its versions — computing shortest left-seed array, longest left-seed array and checking for seeds of a given length. The algorithm for the last problem is used to compute the seed array of a string (i.e., the shortest seeds for all the prefixes of the string) in O(n) time. We describe also a simpler alternative algorithm computing efficiently the shortest seeds. As a by-product we obtain an O(n log (n/m)) time algorithm checking if the shortest seed has length at least m and finding the corresponding seed. We also correct some important details missing in the previously known shortest-seed algorithm (Iliopoulos et al., 1996).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

[hal-00742061, v1] Efficient Seeds Computation Revisited

The notion of the cover is a generalization of a period of a string, and there are linear time algorithms for finding the shortest cover. The seed is a more complicated generalization of periodicity, it is a cover of a superstring of a given string, and the shortest seed problem is of much higher algorithmic difficulty. The problem is not well understood, no linear time algorithm is known. In t...

متن کامل

Sweep Line Algorithm for Convex Hull Revisited

Convex hull of some given points is the intersection of all convex sets containing them. It is used as primary structure in many other problems in computational geometry and other areas like image processing, model identification, geographical data systems, and triangular computation of a set of points and so on. Computing the convex hull of a set of point is one of the most fundamental and imp...

متن کامل

Gromov-Witten invariants of Fano hypersurfaces, revisited

The goal of this paper is to give an efficient computation of the 3-point Gromov-Witten invariants of Fano hypersurfaces, starting from the Picard-Fuchs equation. This simplifies and to some extent explains the original computations of Jinzenji. The method involves solving a gauge-theoretic differential equation, and our main result is that this equation has a unique solution.

متن کامل

Fast Computation of Good Multiple Spaced Seeds

Homology search finds similar segments between two biological sequences, such as DNA or protein sequences. A significant fraction of computing power in the world is dedicated to performing such tasks. The introduction of optimal spaced seeds by Ma et al. has increased both the sensitivity and the speed of homology search and it has been adopted by many alignment programs such as BLAST. With the...

متن کامل

Article Pre-print Quantisation Revisited: A Mathematical and Computational Model

A nascent theory of near division is presented, from which an efficient quantisation algorithm for rhythm intervals is derived. Based on a number theoretic analysis of the uniqueness and convergence of this first algorithm, a generalised algorithm is presented. An empirical study of the algorithm’s performance reveals a readily computable criterion within which the perceived ratio may reliably ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011